Goto

Collaborating Authors

 atrous spatial pyramid pooling


CASCRNet: An Atrous Spatial Pyramid Pooling and Shared Channel Residual based Network for Capsule Endoscopy

Srinanda, K V, Prabhu, M Manvith, Lal, Shyam

arXiv.org Artificial Intelligence

This manuscript summarizes work on the Capsule Vision Challenge 2024 by MISAHUB. To address the multi-class disease classification task, which is challenging due to the complexity and imbalance in the Capsule Vision challenge dataset, this paper proposes CASCRNet (Capsule endoscopy-Aspp-SCR-Network), a parameter-efficient and novel model that uses Shared Channel Residual (SCR) blocks and Atrous Spatial Pyramid Pooling (ASPP) blocks. Further, the performance of the proposed model is compared with other well-known approaches. The experimental results yield that proposed model provides better disease classification results. The proposed model was successful in classifying diseases with an F1 Score of 78.5% and a Mean AUC of 98.3%, which is promising given its compact architecture.


EWasteNet: A Two-Stream Data Efficient Image Transformer Approach for E-Waste Classification

Islam, Niful, Jony, Md. Mehedi Hasan, Hasan, Emam, Sutradhar, Sunny, Rahman, Atikur, Islam, Md. Motaharul

arXiv.org Artificial Intelligence

Improper disposal of e-waste poses global environmental and health risks, raising serious concerns. The accurate classification of e-waste images is critical for efficient management and recycling. In this paper, we have presented a comprehensive dataset comprised of eight different classes of images of electronic devices named the E-Waste Vision Dataset. We have also presented EWasteNet, a novel two-stream approach for precise e-waste image classification based on a data-efficient image transformer (DeiT). The first stream of EWasteNet passes through a sobel operator that detects the edges while the second stream is directed through an Atrous Spatial Pyramid Pooling and attention block where multi-scale contextual information is captured. We train both of the streams simultaneously and their features are merged at the decision level. The DeiT is used as the backbone of both streams. Extensive analysis of the e-waste dataset indicates the usefulness of our method, providing 96% accuracy in e-waste classification. The proposed approach demonstrates significant usefulness in addressing the global concern of e-waste management. It facilitates efficient waste management and recycling by accurately classifying e-waste images, reducing health and safety hazards associated with improper disposal.


The DeepLab Family

#artificialintelligence

Image segmentation tasks have seen lots of developments in recent years, and have become one of the most researched topics in Computer Vision⁶. One of the standards for segmentation is represented by the Deep Labelling for Image Segmentation architecture, also known as DeepLab. The approach was developed by Chen et al.¹ ² ³ ⁴ and different versions employing different mechanisms were proposed over time. In this article, a brief overview of the different DeepLab algorithms and their basic functioning will be given. The first appearance of the DeepLab architecture is found in [1].